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THE DISTRIBUTION OF POINTS ON CURVES OVER FINITE 
FIELDS IN SOME SMALL RECTANGLES 

KIT-HO MAK 



Abstract. Let p be a prime. We study the distribution of points on a class of 

p«j I curves C over ¥p inside very small rectangles B for which the Weil bound fails 

■ ■ to give nontrivial information. In particular, we show that the distribution of 

points on C over some long rectangles is Gaussian. 



1. Introduction and statements of results 



H 

7 ! ' Let p be a large prime, and let C C A^ := A^(Fp) be an absolutely irreducible 

afSne plane curve over ¥p of degree d > 1. We identify the affine plane with the 
set of points with integer coordinates in the square [0,p — 1]^. For a rectangle 
3. B — XxJ'C-[0^p— 1]^, we define Ni3{C) to be the number of (rational) points 

on C inside B. When B = [0,p- 1]^, we wih write N{C) = N[o,p-i]2{C) for the 
number of points on C. It is widely believed that the points on C are uniformly 
distributed in the plane. That is, 

»: . . . ^ . 

^SJ , In fact, using some standard techniques involving exponential sums, one can show 

lO ' that the classical Weil bound [11] together with the Bombieri estimate [1] imply 

o : (2) mc) = N{c) ■ ^^ + oid'^iog'p), 

<^ ■ . . . ^ 

where the implied constant is absolute. If / and g are two functions of p, we write 

(3) / = nig) 

^^ I to denote the function f /g tends to infinity as p tends to infinity. In other words, 

?—( ■ (|3l) is equivalent to g = o{f). The main term of ([2]) dominates the error term when 

(4) vol(B)>p2log^+'p. 

In those cases ((!} holds. A natural and intriguing question that arises is whether 
([T]) continues to hold for smaller boxes B. However, very few is known for the 
number of points N^{C) in a small B. Indeed, given a particular small B that do 
not satisfy (jl]), we do not even know if B contains a point or not. 

One way to study Ns{C) for small B is to consider results on average. For 
instance, Chan 2^ considered the number of points on average on the modular 
hyperbola xy = c modulo an odd number q, and showed that almost all (here 
"almost all" means with probability one) boxes satisfying 

vo\{B) > 0{qi+') 
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have the expected number of points. Recently, Zaharescu and the author [6] gen- 
erahzed the result of Chan to all curves over ¥p. 

Another result of similar sort with only one moving side for C being the modular 
hyperbola was obtained by Gonek, Krishnaswami and Sondhi ^. In our language, 
they showed that ii B — {x,x + H]x J' with H very small and J^ of size comparable 
to p, then the numbers of points inside B exhibit a Gaussian distribution when we 
move the box B horizontally. A Gaussian distribution is also obtained by Zaharescu 
and the author [8 in a similar situation. More precisely, we show that under some 
natural conditions, for C, B as above, and if at least one of the character x, ip is 
nontrivial, the projections of the values of the hybrid exponential sum 

(5) ^- E xiaim^ifim 

Pi£Cr\B 
to any straight lines passing through the origin exhibit a Gaussian distribution when 
we move B horizontally. We note that when C is the afhne line, a two-dimensional 
distribution of S is obtained by Lamzouri [5]. 

The aim of this paper is continue the study of Nis{C) for small rectangle B. In 
particular, we show that for a large class of curves C, the distribution of Ns{C) for 
the B above is Gaussian. Our first step is to study the patterns of points on curves, 
which is crucial for our study of N^{C) and may be of independent interest. 

The study of patterns was first introduced by Cobeli, Gonek and Zaharescu [3], 
where they get results for the distribution of patterns of multiplicative inverses 
modulo p. We generalize their definition of patterns to curves by viewing the 
patterns in [3] as patterns on the two coordinates for the curve xy = 1. For any 
positive integer s, let a = (ai, . . . , as), b = (6i, . . . , hg) be two vectors so that all 
Oi's are coprime to p, and all aj~ 6i, . . . ,a~^hs are distinct modulo p. Define an 
(a, h)-pattern to be an s-tuple of points (Pi, . . . , Ps), where each Pi is of the form 
(aiX + bi,yi) for some x. As in j3|, we may further restrict all the yi to lie in a 
specific interval J as we see fit. 

In the case of the modular hyperbola in [3^, \i J = [0,p — 1] the number of 
patterns is just p — 1, since each x corresponds to exactly one y on the curve. 
However, for a general curve C and any vector a, b, we do not know a priori that 
even one pattern exists, since the two coordinates will not in general corresponds 
bijectively. Nevertheless, we are able to estimate the number of patterns for a large 
class of curves. Let P{I, J) := Pa.,h{C; I, J') be the number of patterns with x E I 
and all y-coordinates lie in ^J, then we have the following. 

Theorem 1. Let C be a plane curve given by the equation f{x,y) = 0. Let 

(6) TT : C-> A\ {x,y) ^ x 

be the projection of C to the first coordinates over ¥p. Suppose there is an x E ¥p so 
that TT ramifies completely, and let a = (ai, . . . , Os), b = (&i, . . . , 6s) be two vectors 
so that all a.i 's are coprime to p, and all a^ 6i, . . . , a'^^bs are distinct modulo p, 
then 

l<^l \ , ^/ ,2s ^i„„s+l 



In the case X = [0,p — 1], the error term can be slightly improved. 
P{[0,p^llJ)^p(W\ +Oid^^^log^p). 
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Note that our estimation for the number of patterns is independent of a and b. 

We are now ready for the study of distribution of Nb{C) for small B. We fix an 
interval J^ C [0,p — 1], and let N = \J'\. For any H > (which may depends on 
p), let Bx = {x,x + H] X J . From now on, we will assume the following condition. 

(7) For any given x, there is at most one y so that (x, y) <^ C r\ J. 

This is the same condition we imposed in 18] when Zaharescu and the author study 
the distribution of hybrid exponential sums over curves. 
Define 



(8) Ah{H)^^{N,sAC) 

to be the fc-th moment of the number of points in C \^ Bx about its mean. We 
also define fJ-k{H, P) to be the fc-th moment of a binomial random variable X with 
parameter H and P, i.e. 

(9) ^lk{H, P) := E{{X ~ HPf) = f] (f )^'(1 " P)''-''{h - HPf. 

h=l ^ ^ 

We estimate the moment Mk{H) using the binomial model with parameter H and 

N/p. 

Theorem 2. Fix a positive integer k. Let C he a curve satisfying the assumptions 
in Theorem[l\ and the additional condition ([T]). Set Bx, H, N as above, we have 

Mk{H)=pv{H,N/p) + Ofe(d2'=ff'=v^ log'p). 

For a fixed fc, it is well-known (see Montgomery and Vaughan |10j . Lemma 11) 
that 

^lk{H,P)<^{HPf'^ +HP 

uniformly for < P < 1 and H = 1,2,3,.... Therefore, Theorem [2] immediately 
implies the following. 

Corollary 3. Assumptions as in Theorem\^ For any fixed k, we have 

MkiH) «fc p{HN/pf' + HN/p + d^'^H'^^log'^p. 

Remark 1.1. For the case of curves, [6] Theorem 2] gives an upper bound for the 
second moment of Nb{C) when B is allowed to move freely on the plane. That 
theorem implies M2(i?) < PtJ-2iH, N/p). Since H2iH, P) = HP{1 - P), Theorem[2] 
shows that 6, Theorem 2] has the correct main term, and therefore is best possible 
for the case of curves (with suitable H and N). 

Let 

{1 • 3 • . . . • (fe — 1) , fc even, 
, fc odd, 

then (see [lOj Lemma 10]) 

iXk{H,P) = {vk + o{l)){HP{l-P)f'^ 

as HP{1 — P) tends to infinity. From this and Theorem[2]we obtain the following. 
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Corollary 4. For any fixed k, if H — o i -^^ — 1 and {HN/p){\ — N/p) —> oo as 
p tends to infinity, then 

f HN ( iV^ ^ ''^^ 



V 

In particular, when N ~ cp, < c < 1 and logH / \ogp -^ as p tends to infinity, 
the distribution of Nq^ (C) tends to a Gaussian distribution with mean HN /p and 
variance {HN/p){l — N/p). 

Remark 1.2. If condition ([7]) does not hold, we may still have Gaussian distribution 
for the Nb^{C). For example, if C is a hyperelliptic curve, and choose J to be the 
interval {—ap, ap\ for some < a < 1/2, then generically one a;-coordinate on 
the curve corresponds to two y-coordinates. From Corollary 21 we have Gaussian 
distribution for Ji — [0,ap], and also for J2 — [— ap, 0], with the same mean and 
variance. After combining the two of them we will have Gaussian distribution for 
the whole interval J . 

2. Preliminary lemmas 

In this section we collect all the preliminary lemmas that will be used in the 
subsequent sections. The first lemma is the Weil bound for space curves. For a 
proof, see [9l Theorem 2.1]. 

Lemma 2.1. Let C be an absolutely irreducible curve in the affine r -space KZ, of 
degree d > 1, which is not contained in any hyperplane. Let B ^ Ti x . . . x Xr be a 
box, then 

Nb{C) - p ■ ^^ + 0{d'^\og'p), 
where t is the number of intervals Xi that are not the full interval [0,p — 1]. 

The next lemma states that if we translate a set in Fp a small number of times, 
it will always reach a new element. This lemma allows us to show later that some 
curves are absolutely irreducible. 

Lemma 2.2. Let r > 2, xi,...,Xr G Fp 6e r distinct elements. Suppose M 
is a nonempty finite subset of the algebraic closure Fp with A\M\ < p~ . Then 
there exists a j € {1, . . . ,r} such that the translate M + Xj is not contained in 
Ui^j(7W +Xi). 

Proof. Suppose (xi, . . . ,Xr,M) provides a counterexample to the statement of the 
lemma. Then it is clear that for any nonzero t e Fp, the tuple (txi, . . . , txr, tAA) is 
another counterexample. 

By Minkowski's theorem on lattice points in a convex symmetric body, there 
exists a nonzero integer t such that 

<p-l 

<{p-l)-i 



<{p-ir-^. 





' 


t\ 


< 




p 

tXr 




L 


P 
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Thus there are integers yj such that 

I j/j = txj (mod p) 

for any j £ {1, . . . , r}, and (yi, . . . , y^i iA^) provides a counterexample. Now let 
jo be such that |j/j,j| = maxi<j<r \yj\- Choose a £ tAi and consider the set Ai = 
tAi n (a + Fp). Then (j/i, . . . , yr,Ai) will also be a counterexample. 

Note that a + ¥p can be written as a union of at most |A^| intervals (i.e. subsets 
of ¥p consisting of consecutive integers or its translate in Fp) whose endpoints are 
in A4. Let {a + a,a + a + 1, . . . ,a + b} be the longest of these intervals. Then 

^-\M\-\M\ 
By this, (fTO)) and the hypothesis 4 \M\ < p~ , we have 

|6-a|>V-^>2|2/,J. 

Now if j/jQ > 0, then a + a + yj^ belongs to A^ + yj^ but does not belong to 
Lii^jg{M + yi), while if yj^ > 0, then a + b + yjg belongs to A^ + yj„ but does 
not belong to Ui^jg{M + yt)- This contradicts the fact that {yi, . . . ,yr,A4) is a 
counterexample, and completes our proof. D 

Recall that the Stirling number of second kind, S{r, i), is by definition the number 
of partition of a set of cardinality r into exactly t nonempty subsets. The proof of 
the following lemma can be found in \TU\. 

Lemma 2.3. Let fik{H,P) be defined by ([9]), then 

3. Patterns of curves: Proof of Theorem [T] 

Let C be a plane curve given by the equation f{x, y) ~ and two vectors 
a = (ai, . . . , Us), b = (6i, . . . , 6^) so that p| a^ and aj~ bi, . . . , aj^bg are all distinct 
modulo p, we define the x-shifted curve Ca,b to be the space curve in the affine 
(s + l)-space with varaibles x^yi, . . . ,yr and equations 

(11) f{a^x + b,,y,)=0,yl<^<s. 

It is not difficult to see that Ca,b is indeed a curve, and its degree is less than or 
equal to d^ . Note that similar constructions appeared in [71 |H1 IS] ■ 

It is clear from the defining equations pT|) that a point on Ca,b corresponds to 
an (a, b)-pattern on C, i.e. 

Pa,b(C,I,^)=A^B(Ca,b), 

where B = I x {JY . We want to show that Ca,b is absolutely irreducible. Cur- 
rently we are not able to prove this for all curves C, but we are able to show the 
irreducibility for the class of curves so that the projection tt defined by ^ has a 
completely ramified point. 

Proposition 3.1. IJ C satisfies the assumptions in Theorem]^ then Ca,b 'is abso- 
lutely irreducible. 
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Proof. For 1 < j < s we define Cj to be the curve given by the first j equations in 
(HH), i.e. 

f{aiX + hi, yi) =0, VI < i < j. 

We have a chain of coverings of curves, 

C'a.b = Cs — ?► Cs~l ^- . . . ^- Ci = C, 

where each arrow represent a projection tt^ given by (x, j/i, . . . , yi) i— > (a;, j/i, . . . , yi-i). 
Let 5 C Fp be the set of completely ramified points for the map tt : C — > A^ . Since 
all the Xi = bia~ are distinct, we can apply Lemma l2.2l with Xi = bia~^ to conclude 
that there are new completely ramified points in each level of the above chain of 
coverings. Since C is absolutely irreducible, this shows that Ca,b is also absolutely 
irreducible. D 

We are now ready to prove Theorem [TJ By Proposition 13.11 if C satisfies the 
assumptions in the theorem, then Ca,b is absolutely irreducible in A^"^^. Theorem 
[1] now follows easily from Lemma 12.11 

4. Estimation of Mk{H): Proof of Theorem [2] 
Using the binomial theorem to expand the right hand side of ([5]), we obtain 



M.(i/) = ^^ iV.JC) 



x=Or=0 ^ ^ \ ^ 

r=Q ^ / ^ ^ / a:=0 

Here we make the convention that if r = 0, Nq^{CY = 1 even when Nb^{C) = 0. 
Define 

x=0 

Clearly So{H) — p (by our convention). For r > 1, we have 

(12) sah)=j: j: ... Y. i-E E 1- 

{xi ^...^Xj^}C.(x,x-\-H] 

For each 1 < i < r , let Xi — x + ai, and let A be the set of distinct a^'s. Set |^| = t. 
We have A C {1,2,..., H}. From the definition of the Stirling number of second 
kind, we see that for any given A, the number of sets with {xi, . . . ,Xr} = ^ is 
S{r,t)tl. Grouping the terms in (IT^ according to different values of t, we obtain 

r p— 1 

(13) Sr{H) = Y,S{r,t)t\ J2 J2 E 1- 

t=l \A\=t x=0(x+bi,yi)eC,l<i<r 

AC[1,H] y,&J 

By condition (O, the inner sum 

p-i 

E E 1 

x=0 (x+bi,yi)£C.l<i<r 

yiej 
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is the number of (a, b)-pattcrn of C with a=(l,l,...,l), bis any t-tuple ordering 
of the set A, and all y coordinates lie in J'. By Theorem [l] this sum is 



p-i 



x=0 (x+bi,yi}eC,l<i<r 






Put this into p3)) yields 

t=l \A\=t 

AC[1,H] 



pL 



:p^5(r,i)t!(f)(^y + 0('^5(r,t)i!(^)d-V^Wp 



Therefore, 



We can insert the terms with t — without altering the sum since 5(r, 0) = for 
any r > 1 (and for r = the inner sum is understood to be zero), thus we may 
apply Lemma 12.31 with P — N/p to conclude that 

Mk{H)^pv{H,Nlp) + Ofc(d2fcii-fc^log'=p). 

This completes the proof of Theorem [5] 
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